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ABSTRACT 

Uracil-DNA glycosylase (UDG) is a key repair 
enzyme responsible for removing uracil residues 
from DNA. Interestingly, UDG is the only enzyme 
known to be inhibited by two different DNA mimic 
proteins: p56 encoded by the Bacillus subtilis phage 
<t>29 and the well-characterized protein Ugi encoded 
by the B. subtilis phage PBS1/PBS2. Atomic-reso- 
lution crystal structures of the B. subtilis UDG 
both free and in complex with p56, combined with 
site-directed mutagenesis analysis, allowed us to 
identify the key amino acid residues required for 
enzyme activity, DNA binding and complex forma- 
tion. An important requirement for complex forma- 
tion is the recognition carried out by p56 of the 
protruding Phe191 residue from B. subtilis UDG, 
whose side-chain is inserted into the DNA minor 
groove to replace the flipped-out uracil. A compara- 
tive analysis of both p56 and Ugi inhibitors enabled 
us to identify their common and distinctive features. 
Thereby, our results provide an insight into how 
two DNA mimic proteins with different structural 
and biochemical properties are able to specifically 
block the DNA-binding domain of the same enzyme. 

INTRODUCTION 

Genomic DNA is continuously exposed to damage by 
internal or external agents, which can generate a variety 
of DNA lesions threatening genome integrity and cell 



viability. To prevent the deleterious effects caused by 
DNA damage, organisms have developed a number of 
DNA repair mechanisms (1-3). Uracil, a base normally 
found in RNA, is one of the most frequent lesions in 
genomic DNA. Uracil may arise in DNA either by 
misincorporation of deoxyuridine monophosphate 
(dUMP) instead of deoxythymidine monophosphate 
(dTMP) during DNA synthesis or by spontaneous de- 
amination of cytosine in DNA. Most DNA polymerases 
are able to incorporate dUMP and dTMP with similar 
efficiency (4,5) producing U:A pairs that are not directly 
mutagenic, but may become genotoxic by impeding 
sequence recognition carried out by regulatory proteins 
(6). On the other hand, cytosine deamination can lead to 
GC^AT transition mutations after the next round of rep- 
lication posing a serious threat to genome integrity (7,8). 

Uracil-DNA glycosylases (UDGs) are the enzymes re- 
sponsible for removing uracil residues from DNA. UDGs 
initiate the base excision repair pathway by hydrolysing 
the N-glycosidic bond between the uracil residue and the 
deoxyribose sugar of the DNA backbone generating an 
apurinic-apyrimidinic site (9). UDGs have been classified 
into four distinct families (10). Members of Family- 1 are 
ubiquitous UDG proteins that are able to excise uracil in 
both single- and double-stranded DNA (10). 

Several structural studies of UDG in complex with 
DNA have led to propose a complex mechanism of 
action for this enzyme (11-16). These analyses revealed 
that UDGs bind, kink and compress the DNA 
backbone via the action of highly conserved Ser-Pro 
loops (LII, LIV and LV) while scanning the minor 
groove for a uracil lesion. The enzyme is hypothesized 



*To whom correspondence should be addressed. Tel: +34 91 1964675; Fax: +34 91 1964420; Email: msalas@cbm.uam.es 
Correspondence may also be addressed to Beatriz Gonzalez. Tel: +34 91 5619400; Fax: +34 91 5642431; Email: xbeatriz@iqfr.csic.es 

The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors. 

© The Author(s) 2013. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ 
by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial 
re-use, please contact journals.permissions@oup.com 



6762 Nucleic Acids Research, 2013, Vol. 41, No. 13 



to induce a further compression of the DNA backbone 
flanking the uracil residue, resulting in the flipping of 
the uracil out of the DNA helix and into the active site 
pocket of UDG. The hydrophobic side chain of a 
conserved leucine, Leu272 in human UDG (HsUDG) 
and Leul91 in Escherichia coli UDG (EcUDG), is 
inserted into the site vacated by the flipped-out uracil. 
This protruding leucine partially restores base-stacking 
interactions and probably functions as a 'door stop' for 
preventing the return of the flipped-out uracil into the 
DNA helix, as suggested (17). 

The binding of UDG to uracil is proposed to induce 
electron flow from 04 f of the deoxyribose to 02 of 
uracil. This negative charge is enzymatically stabilized by 
a neutral conserved histidine (HI 87 in EcUDG, H268 in 
HsUDG). A weakly nucleophilic water molecule com- 
pletes hydrolysis. Protonation on Nl following bond 
cleavage gives the more stable amide and completes the 
reaction. As a result, a free uracil molecule and an abasic 
site in the DNA are obtained (6,14). 

Despite the importance of removing uracil from DNA 
in the maintenance of genome stability, some proteins are 
known to naturally inhibit the repair activity of UDGs. 
Particularly, uracil-containing DNA bacteriophages have 
developed specialized strategies to counteract the cellular 
uracil excision repair pathway to survive and replicate in 
their hosts. For instance, phages PBS1 and PBS2 encode 
an inhibitor of Bacillus subtilis UDG (BsUDG) called Ugi, 
which is critical to preserve the uracil residues 
incorporated into the uracil-containing phage DNA (18). 
Ugi is a small (84 amino acids) highly acidic protein that is 
able to form a physiologically irreversible complex with a 
variety of UDG proteins in 1:1 molar stoichiometry by 
mimicking enzyme-DNA interactions (19-22). 

Recently, we have reported the identification of a novel 
inhibitor encoded by the B. subtilis phage cj>29, called 
protein p56, which is able to bind and block the host 
UDG activity (23). Although the 4)29 genome does not 
contain uracil residues, p56, a small (56 amino acids) 
highly acidic protein has been proposed to prevent the dele- 
terious effects caused by the host UDG activity in the 4>29 
genome integrity, if uracil is removed from the replicative 
intermediates. The capacity of p56 for blocking the DNA- 
binding domain of UDG and the ability of Ugi to replace 
p56 previously bound to UDG suggests that p56 is able to 
inhibit UDG by mimicking DNA properties (24). 

Bacteriophage proteins p56 and Ugi have likely evolved 
to successfully inhibit the same enzyme for preserving 
virus viability. Currently, only a few proteins with DNA 
mimic features have been characterized (21). Furthermore, 
the capacity of these proteins to show an effective mimicry 
of the interactions displayed by DNA with their targets 
without resulting in cross-reactivity with other DNA- 
binding enzymes remains elusive. 

In this study, we sought to explore the mechanisms 
underlying BsUDG inhibition by p56. The data obtained 
from the atomic-resolution crystal structure of BsUDG in 
complex with p56, together with the results of site-directed 
mutagenesis analysis, allowed us to determine the most 
relevant residues involved in this interaction. Thus, our 
results reveal new structural and functional insights into 



the mechanism of BsUDG inhibition by p56 and provide 
additional evidence for understanding the interaction of 
DNA mimic proteins with their targets. 

MATERIALS AND METHODS 

DNA substrates 

Oligonucleotide ssDNA-U16 ^-CTGCAGCTGATGCG 
CUGTACGGATCCCC-GGGTAC-3) containing a 
single uracil residue at position 16 was purified electro- 
phoretically on 8M urea/20% polyacrylamide gels and 
then 5'-end labelled using [y- 32 P]-adenosine triphosphate 
(3000 Ci mmol-1) (Perkin-Elmer Life Science) and T4 
polynucleotide kinase (New England Biolabs). To 
generate dsDNA substrates, the 5 / - 32 P-labelled ssDNA- 
U16 oligonucleotides were annealed to complementary 
non-labelled oligonucleotides (34 mer), containing a 
guanine residue opposite to uracil. Hybridizations were 
performed in the presence of 60 mM Tris-HCl (pH 7.5) 
and 0.2 M NaCl for 10 min at 70°C and then slowly cooled 
to room temperature. Oligonucleotides were obtained 
from Isogen Bioscience BV. 

Site-directed mutagenesis of feUDG, GST-feUDG 
and p56 

Protein mutants were obtained by using the QuickChange 
site-directed mutagenesis kit provided by Stratagene. To 
obtain BsUDG mutants, plasmid pT7-4-UDG-wt was 
used as a template for the mutagenesis reaction with 
specific oligonucleotides. The PCR reaction was per- 
formed using PfuTurboTM DNA polymerase, and then 
the product was treated with Dpnl endonuclease. The 
amplified DNA was transformed into E. coli BL21 
(DE3) competent cells. The presence of the desired 
mutation was confirmed by sequencing the entire gene. 
The same protocol was performed to generate mutants 
of Glutathione S-Transferase (GST)-^UDG (GST was 
attached to the N-terminal region of BsUDG) and of 
p56 except for the template vector: pGEX-2T-UDG-wt 
for GST-&UDG mutants and pT7-3-p56-wt for p56 
mutants. Both plasmids were previously described (25). 
The specific oligonucleotides used in the mutagenesis re- 
actions are listed in Supplementary Table S 1 . 

Determination of UDG activity 

UDG activity was determined as described (25). To test 
the BsUDG inhibition by p56, the minimal BsUDG 
amount needed to obtain 50% cleavage of substrate was 
incubated with the indicated amount of p56 proteins 
at 4°C for 15 min, and then added to a reaction buffer 
containing 32 P-labelled substrate. 

DNA gel retardation assay 

The DNA-binding ability of the GST-iMJDG wild-type 
and the mutants were examined by gel retardation assays. 
Proteins were incubated with 34 bp 5 / - 32 P-labelled dsDNA 
U:G (0.7 nM) in a final volume of 20 |il of binding buffer 
[50 mM Tris-HCl (pH 7.5), 4% (vol/vol) glycerol, 1 mM 
dithiothreitol (DTT), 0.1 mg/ml of bovine serum albumin] 
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for 5min at 4°C. Then the samples were analysed by non- 
denaturing polyacrylamide gel electrophoresis (6% poly- 
acrylamide), which was pre-run at 30 mA for 20 min. 
Electrophoresis was carried out at 4°C in TAE buffer 
(40 mM Tris, ImM EDTA, pH set to 7.5 with acetic 
acid) at 30 mA for 30 min. Gels were vacuum dried, and 
the radioactive bands were detected by autoradiography 
and quantified by densitometry. 

GST pull-down assay 

The pull-down assay was used to determine the interaction 
between GST-ifaUDG and p56 proteins. A mixture con- 
taining 30 jig of GST-ifaUDG (or GST as a control) and 
3 jig of p56 in a final volume of 100 ul of buffer PBS [137 
mM NaCl, 2.7 mM KC1, 10 mM Na 2 HP0 4 and 2mM 
KH 2 P0 4 (pH 7.4)] was incubated for 15 min on ice (all 
the following steps were carried out at 4°C). After that, 
the samples were mixed gently with 60 ul of a 50% slurry 
of glutathione-sepharose beads (Glutathione Sepharose 4 
Fast Flow from GE Healthcare), and then 700 ul of PBS 
was added to ensure an efficient mixing of reagents. The 
samples were incubated overnight with an end-over-end 
mixing and then centrifuged at 2000 rpm for 5 min. The 
beads were washed once with 10 volumes of PBS and 
mixed end-over-end during 20 min and then once more 
with TEN600 buffer [20 mM Tris-HCl (pH 7.5), 0.1 mM 
EDTA and 600 mM NaCl] to remove unbound proteins. 
The elution of the GST-&UDG and the bound p56 
proteins was performed by boiling the samples in loading 
buffer [37 mM Tris-HCl (pH 6.8), 2% (w/vol) sodium 
dodecyl sulfate, 4% (vol/vol) (3-mercaptoethanol 
and 13% (vol/vol) glycerol] during 5 min. Samples 
were analysed by 16% (w/vol) polyacrylamide 
Tris-Tricine gels and visualized using Coomassie blue 
staining. 

Crystallization of 2faUDG-p56 complex and BsUDG 

For &UDG-p56 complex crystallization, ammonium 
sulphate pellets of UDG from B. subtilis and p56 from 
4>29 were resuspended in buffer A [20 mM Tris-HCl 
(pH 7.5) and 1 mM EDTA], mixed in a 1:1.5 molar ratio 
to a final concentration of 1 1 .5 mg/ml and dialyzed against 
buffer A. Inital crystallization trials were done using a 
NanoDrop robot (Innovadyne Technologies Inc.). Two 
commercial screens, Crystal Screen I (Hampton 
Research) and JCSG+ screen (Qiagen), yielded prelimin- 
ary crystals in different conditions that were further 
optimized. The best crystals obtained for ifaUDG-p56 
complex appeared at 4°C after 7-8 days in sitting drops 
prepared by mixing 1 ul of the protein complex plus 1 ul of 
reservoir solution and using 19% PEG 8000, 0.1 M Tris- 
HCl (pH 8.5), 0.25 M MgCl 2 as precipitant. The crystals 
were cryoprotected using the crystallization condition plus 
20% glycerol and flash frozen in liquid nitrogen previously 
to data collection experiment. Data were collected at 
100K on ID23-2 beamline of the European Synchrotron 
Radiation Facility (ESRF, Grenoble, France). 

For BsVDG crystallization, an ammonium sulphate 
pellet of the protein was resuspended in buffer A to a 
final concentration of 14.7 mg/ml and dialyzed against 



the same buffer. Crystallization trials were performed as 
before using two commercial screens, (PACT Suite and 
JCSG+, both from Qiagen). Tiny rod crystals appeared 
after 3 weeks in a condition containing 0.2 M NaF and 
20% PEG 3350 at 18°C. Further optimization of this con- 
dition did not progress; therefore, these crystals were 
cryoprotected using the crystallization condition supple- 
mented with 20% glycerol. A data set was collected at 
100 K in the ID29 beamline of the ESRF. 

Structure determination, refinement and analysis of 
feUDG-p56 complex and feUDG 

&UDG-p56 crystal data processing and reduction were 
done with iMosflm (26) and Scala (27) from the CCP4 
suite (28). Crystals belong to the space group P2 1 2 1 2 1 , 
with unit o cell parameters a = 53.77, b = 66.27, 
c = 102.28 A. There is one molecule of BsUDG and a 
dimer of p 56 in the asymmetric unit. The crystal structure 
of the ifaUDG-p56 complex was determined by 
Molecular Replacement using MolRep (29) and UDG 
from E. coli as initial search model (Protein Data Bank 
(PDB) code 2eug, 53% sequence identity). The model 
obtained was mutated to BsUDG sequence and refined 
with the rigid body protocol of Refmac5 (30). This pro- 
cedure led to a model that provided initial phases of good 
quality to trace the p56 dimer in the electron density 
maps by Buccaneer (31). Cycles of manual building 
using Coot (32) and refinement using Refmac5 (30) 
were performed. The electron density map showed 
double conformation for several protein residues as well 
as density for a chloride ion and a glycerol molecule, 
which have been modelled. Statistics for data processing 
and structure solution and refinement are shown in 
Supplementary Table S2. Final model includes residues 
3-225 of BsUDG, residues 8-55 of p56 subunit A and 
residues 4-56 of p56 subunit B. 

BsUDG crystal data set was processed with X-Ray 
Detector Software (33), and data were reduced with 
Scala (27) from the CCP4 suite (28). Crystals belong to 
space 1222 group, with unit cell parameters a = 65.30, 
b = 79.79, c = 97.36 A and one protein molecule in the 
asymmetric unit. The structure was determined by 
Molecular Replacement using Phaser (34). The high-reso- 
lution structure of UDG from B. subtilis as seen in 
complex with p56 protein, was used as initial search 
model. Structure refinement was performed alternating 
cycles of model rebuilding with Coot (32) and refinement 
with Refmac5 (30). Statistics for data reduction and re- 
finement are shown in Supplementary Table S2. 

Coordinates validation has been performed with 
PROCHECK (35). Phe78, a residue involved in conform- 
ing the uracil-binding pocket, violates the Ramachandran 
distribution, a characteristic conserved in other UDG 
known structures. The accessible surface areas and inter- 
face analysis have been computed with PISA server (36), 
and the pictures have been made with PyMOL (37). 

Modelling of feUDG-DNA complex 

HsUDG was described to exist in open (free) and closed 
(complexed with DNA) conformations. Therefore, to 
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model the ifaUDG-DNA complex, BsVDG coordinates 
were separated in two domains [domain I (8-79; 
120-154) and domain II (80-111; 159-225], as their 
equivalent domains in several UDG isoforms are 
reported to close on DNA binding (38). Each domain 
was superposed to HsUDG (50% sequence identity), as 
obtained in complex with DNA (pdb code lemh). The 
linkers between domains I and II, and between segments 
in domain II were regularized using Coot (32). The model 
obtained presented a few close contacts at the protein- 
protein interface. Then, to improve the model quality, it 
was subjected to structure idealization using Refmac (30). 
The final model presents 90% residues inside the most 
favoured regions of the Ramachandran distribution, and 
none close contacts, as analysed by PROCHECK (35), 
conserving the general features of the 7/sUDG-DNA 
complex. 

The structural data of proteins from this publication 
have been submitted to the Protein Data Bank and 
assigned the identifier accession codes 3zoq and 3zor for 
2faUDG-p56 and BsUDG structures, respectively. 



RESULTS 

Structure of feUDG-p56 complex 

We have crystallized the complex formed between the 
phage 4>29 protein p56 and the UDG enzyme from 
B. subtilis (BsVDG) at 1.45 A resolution (Figure 1A and 
Supplementary Table S2; see amino acid sequence in 
Supplementary Figure SI). The folding of UDG from 
B. subtilis is conserved with respect to UDGs from other 
organisms previously crystallized (11-16). UDG is a 
classic oc/P protein that consists of four parallel (3-strands 
forming a (3-sheet flanked by several oc-helices on both 
sides (Figure IB). The active site is situated in a wide 
cleft spanning the C-terminus of the (3-sheet whose inner 
strands ((31 and (33) are noticeably opened. This cavity is 
surrounded by several conserved loops named LI, LII, 
LIII, LIV and LV (Figure IB). 

Protein p56, as seen in the ifaUDG-p56 complex, forms 
a tight dimer (subunits A and B; rmsd = 0.423 A). This 
result is in agreement with previous analytical centrifuga- 
tion studies showing that p56 formed dimers under 




Figure 1. Structure of ifaUDG-p56 crystal complex. (A) Protein p56 subunits are represented as magenta (subunit A) and blue (subunit B) cartoons, 
whereas BsUDG is shown as olive surface, transparent on residue Phel91. This code of colours is kept throughout the manuscript. (B) Protein p56 is 
shown as surface and BsUDG as cartoon, showing the loops LI-LV in blue colour. (C) Zoom of figure (A), in a perpendicular orientation. 
(D) Electrostatic surface of: 2?sUDG-p56 complex (left), the interface being highlighted by a dashed line; Bs\JT>G (middle), showing the basic 
character of the p56 binding surface, and p56 (right), showing the acidic character of the BsUDG binding surface in which the position of acidic 
residues is shown. 
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physiological conditions (24) and with Isothermal 
Titration Calorimetry analysis revealing that p56 com- 
plexed with BsVDG in 2:1 stoichiometry (39). Each 
monomer consists of three antiparallel [3-strands and one 
oc-helix, where (31 is the inner strand, and al is linking the 
outer strands [32 and [33 (Figure 1C). These secondary 
structural elements are connected by several loops 
named LI, L2 and L3. The elements from p56 subunits 
A and B are named, respectively, with the suffixes a and b 
therein, to distinguish them from each other. The two p56 
subunits, related by a pseudo 2-fold axis, form a dimer by 
symmetric interactions through ocla-oclb and [33a-[33b 
producing an extended P-sheet with six antiparallel 
Pstrands (Figure 1C). 

The p56 regions involved in BsUDG recognition exhibit 
a negative electrostatic potential, compatible with the 
highly basic character of the DNA-binding loops at the 
enzyme interface (Figure ID). Interestingly, p56 subunits 
bind differently to BsUDG generating a non-symmetrical 
complex. Subunit A of protein p56 produces a much more 
extensive interaction with BsUDG contributing with 
1510 A 2 to the total buried surface of the complex 
calculated in 1815 A 2 (Supplementary Table S3, see 
Figure 2A). The recognition between BsUDG and p56 
involves multiple polar (summarized in Supplementary 
Table S4) and hydrophobic interactions (Figure 2A). 
Our analysis revealed that the main feature of the 
complex formation is the recognition of Phel91 from 
BsUDG by p56. Both p56 subunits use their al as 
clamps to completely enclose Phel91 in a hydrophobic 
pocket located at the dimerization interface (Figure 2B). 
Residues Glu37, Phe36 and Tyr40 from both p56 al 
outline this pocket, representing the only symmetrical 
interaction of the p56 dimer with BsUDG. In addition, 
Glu37 and Tyr40 play an essential role in p56 dimerization 
(39), as the two hydrogen bonds between Glu37a-Tyr40b 
and Glu37b-Tyr40a are crucial in maintaining its 
architecture. 

The interface between p56 subunit B (p56B) and 
BsUDG is formed exclusively by the hydrophobic 
contact through Phel91, plus the polar interaction 
between Ser34b and Glu37b from p56 and Argl94 from 
BsUDG (Figure 2A). However, as aforementioned, 
subunit A of p56 (p56A) produces a much more 
extended interaction with BsUDG. Besides the interaction 
mediated by the BsUDG Phel91, other hydrophobic 
contacts were observed along the whole 2faUDG-p56A 
interface (Figure 2C) like those produced between 
Tyr67, His68 and His 187 from BsUDG and Asn42a, 
His28a and Val38a from p56A. Moreover, several hydro- 
phobic contacts with p56A involve BsUDG prolines, in 
particular Pro87 and Pro88 from the proline rich loop, 
and Pro 190, which interact with the region Y40a-N42a 
of p56. 

Nevertheless, there are also multiple polar interactions 
implicated in the complex formation (Supplementary 
Table S4). In all, 11 of 19 total polar interactions found 
in the complex involve residues from p56 al, either ala 
(Figure 2D) or alb (Figure 2A), but other p56 regions also 
present relevant interactions (Figure 2E). Among the 
polar interactions, those produced between BsUDG LII 



(Pro88 and Ser89) and p56 al (Gly41a) are particularly 
significant, as they involve backbone links (Figure 2D). In 
addition, four basic residues of BsUDG, Lys85, Lysl37, 
Argl66 and Argl94 form several ion pairs that are 
implicated in the complex formation (Figure 2F). 
Specifically, Argl66 and Argl94 interact with Glu37 in 
both ala and alb helices, respectively, whereas Lysl85 
forms an ion net with two acidic residues of p56, 
Asp 19a and Asp45a, both of which are outside of al 
(Figure 2 A and 2D-F). In addition to the direct polar 
links, multiple interactions (17 < 3. 2 A) through water 
molecules are formed within the ifaUDG-p56 interface, 
increasing the strength of the interaction produced 
between both proteins. 

In summary, the main feature of the ifaUDG-p56 
complex is the recognition of BsUDG Phel91 within a 
hydrophobic pocket at the p56 dimer interface. BsUDG 
residues from LV and the remaining DNA-binding loops 
seal this interaction through polar and hydrophobic inter- 
actions established mainly with p56 ala. 

BsUDG and p56 structural changes on complex formation 

To characterize the structural changes that occur in 
BsUDG on p56 binding, we have crystallized UDG 
from B. subtilis and solved its structure at 2.9 A resolution, 
using the high resolution BsUDG model as found in the 
ifaUDG-p56 complex. The final refined BsUDG model 
showed unambiguous density for the whole molecule. 
The superposition of the BsUDG structure free or in 
complex with p56 (rmsd = 0.382 A) reveals variations in 
two of the five BsUDG loops mentioned previously 
(Figure 3A). The most significant change occurs at LV, 
which contains the Phel91, a residue that is completely 
buried in a hydrophobic pocket in the complex while it 
is exposed to the solvent and partially disordered in the 
absence of p56. Loop II, containing several prolines, also 
displays an appreciable variation reflecting its intrinsic 
flexibility. The role of proline residues in protein-protein 
complex formation has been described in previous studies 
(40,41). Despite these modifications, the global BsUDG 
structure does not seem to change significantly on p56 
binding, revealing that p56 inhibits the enzyme in its free 
state conformation. 

Similarly, the p56 fold and dimerization features found 
in the 2faUDG-p56 crystal complex are essentially the 
same as those described for free p56 recently solved by 
nuclear magnetic resonance (NMR) (39) (Figure 3B) 
(rmsd = 1.16 A with respect to the top NMR model: 
pdb code 21e2). 

In summary, the interaction surface is nearly preformed 
in both proteins, suggesting that no additional energetic 
cost is required to form the complex, and providing a 
possible explanation for its tight binding (24). 

Effect of BsUDG mutations on enzymatic activity and 
DNA-binding capacity 

Several data suggest that p56 mimics the interactions 
between BsUDG and the phosphate backbone of the 
DNA. To confirm this possibility, we obtained a structural 
model of the ifaUDG-DNA complex (Figure 4) using the 
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Figure 2. 2faUDG-p56 interaction. (A) p56-ifaUDG interface showing all the interactions produced in the complex. BsUDG is shown as transparent 
surface, highlighting the interacting loops, while p56 a-helices are shown as cartoons. The residues making interactions are shown as sticks. 
(B) BsUDG Phel91 recognition by A and B subunits of p56. Phe 191 binds into a symmetrical hydrophobic pocket situated in the p56 dimer 
interface. (C) Scheme of ifaUDG-p56 hydrophobic interactions. (D) Scheme of polar interactions formed between BsUDG and al of p56A. 
(E) Scheme of polar interactions produced between BsUDG and p56A elements outside al. (F) Surface of the 2faUDG-p56 complex, showing 
three basic residues of BsUDG that make ionic pairs with p56 acidic residues. Residue Lysl37 cannot be shown in this orientation. 



7/sUDG-DNA complex as a template (pdb code: lemh, 
50% sequence identity). This model allowed us to propose 
the most relevant BsUDG residues involved in DNA 
binding and to compare them with those implicated in 
p56 binding. To gain a further insight into the biochemical 
basis of the complex formation, we performed a site- 
directed mutational analysis on BsUDG. 

BsUDG mutants containing the single mutation Q64A, 
H68A, K85S, R166S, H187A, S189A, P190A, F191A 



or R194S, or the double mutation P87A/P88A were 
expressed and purified to apparent homogeneity, and 
their enzymatic activity as well as their DNA-binding 
ability were determined. The UDG activity was signifi- 
cantly decreased in all the mutants (Figure 5A and 
Supplementary Figure S2), although they can be classified 
into three groups according to their residual enzymatic 
capacity. The first group is formed by mutants K85S 
and R166S with a uracil excision capacity ~3-fold 



Nucleic Acids Research, 2013, Vol. 41, No. 13 6767 



Lll A w p56 B 





SsUDG 



Figure 3. Changes produced in BsUDG and p56 upon complex formation. (A) Superposition of BsUDG free and in complex with p56. ifaUDG is 
represented as Ca trace in colour blue (free) and green (p56-bound). Protein p56 is shown as cartoons surrounded by a transparent surface. The most 
variable regions are marked by arrows, in which Phel91 is shown as sticks. (B) Superposition of p56 structures from ifaUDG-p56 crystal complex 
and from NMR structure (coloured grey). 
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Figure 4. Model of IfaUDG-DNA interacting surface. DNA backbone 
is shown as orange cartoons, whereas the bases are shown as blue 
sticks. The flipped uracil is shown as spheres. Oxygen and nitrogen 
atoms are coloured in red and blue, respectively. The binding interface 
of BsUDG is shown as transparent cartoons highlighting the interact- 
ing loops in green colours. The residues mutated for this work are 
shown as green sticks in the above loops. 



reduced with respect to wild-type BsUDG (Figure 5A). 
These residues are not conserved among UDGs from 
different organisms and are located at the boundaries of 
the ifaUDG-p56 complex (Figure 2F). Curiously, electro- 
phoretic mobility shift assays showed that K85S and 
R166S were the mutants most affected in DNA binding 
(Figure 5B and Supplementary Figure S3), as they 
retained only <40% of the wild-type protein binding. 
Previous studies have proposed that basic residues on 
HsUDG surface have a role in attracting DNA (11). 
Thus, the moderate reduction in enzymatic activity in 



these mutants could be a consequence of the decrease in 
DNA binding. The second group of mutants, formed by 
H68A, P87A/P88A, S189A and R194S, have an activity 
decrease ~6-l 0-fold when compared with wild-type 
BsUDG. With the exception of Argl94, these residues 
are located at the entrance of the active site to ensure a 
proper DNA positioning, in particular that of uracil. The 
equivalent residues of His68 and Serl89 in HsUDG 
(Hisl48 and Ser270, respectively) bind to the DNA phos- 
phates, whereas conserved prolines 87 and 88 (Pro 167 and 
Pro 168 in HsUDG) pack against DNA to hold the 
5 ; -phosphate from the uracil-containing nucleotide. 
Although far from the active site, the equivalent residue 
of Argl94 in HsUDG (Tyr275) stacks with DNA bases 
making a hydrogen bonding interaction (42). Despite the 
fact that BsUDG activity was clearly affected, the 
mutation of these residues did not produce a substantial 
decrease in DNA binding (Figure 5B and Supplementary 
Figure S3) probably because the specific recognition 
between BsUDG and DNA involves multiple interactions 
through several residues that contribute to complex for- 
mation. The third group of mutants (Q64A, H187A, 
PI 90 A and F191A) was severely impaired, as they 
showed no detectable UDG activity (except in the case 
of PI 90 A, which retained a 4% activity) even when the 
amount of mutant proteins was increased 10-fold 
compared with that of the wild-type. Other studies 
demonstrated that HsUDG residues Gin 144 and His268 
(equivalent to BsUDG Gln64 and His 187) interact with 
uracil oxygens through its main and side chains, respect- 
ively. In addition, Glnl44 is necessary to orient the uracil 
properly into the active site through its side chain, whereas 
His268, a catalytic residue proposed to be involved in the 
uracil excision process (11), coordinates a DNA phos- 
phate through its main chain. Phel91 is also essential 
for BsUDG function and is equivalent to Leu272 of 
HsUDG. The side chain of Leu272 is inserted into the 
DNA minor groove protruding into the DNA base stack 
to replace the flipped-out uracil nucleotide. The 
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UDG 



100 < 1 13 35 13 37 <1 10 4 <1 16 % 



B 180 




wt Q64A H68A K85S 8 ° A R166S H187A S189A P190A F191A R194S UDG 
100 72 53 38 96 35 48 93 88 161 73 % 




UDG 



100 85 88 88 106 97 5 17 9 2 28 % 

Figure 5. Influence of the mutated residues of l?sUDG on the enzymatic activity, DNA-binding capacity and complex formation with p56. 
(A) Enzymatic activity of IfaUDG mutants compared with wild-type BsUDG. The 34 bp 5 / - 32 P-labelled ssDNA substrate containing a single 
uracil residue at position 16 was incubated with the different proteins. Bands corresponding to DNA substrate and DNA product were monitored 
by autoradiography. (B) The DNA-binding ability of the GST-ifaUDG wild-type and the mutants was examined by gel retardation assays. Proteins 
were incubated in binding buffer with 5 7 - 3 P-labelled dsDNA U:G substrate, and the samples were analysed by non-denaturing polyacrylamide gel 
electrophoresis. Bands were detected by autoradiography. (C) Role of the mutated residues of GST-lfaUDG in the interaction with protein p56 as 
determined by pull-down assay. After incubation, the quantity of p56 bound to GST-2?sUDG proteins was determined by Tris-Tricine polyacryl- 
amide gel electrophoresis and visualized using Coomassie blue staining. Bands were quantified by densitometry. The percentage of UDG activity (A), 
DNA-binding capacity (B) or p56-binding ability (C) for the mutants was calculated with respect to the wild-type BsUDG. Each bar represents the 
mean value for three independent experiments with error bars indicating the standard deviations. 



replacement of Leu272 with alanine (L272A) practically 
abolished UDG activity (14), as it occurs in BsUDG 
F191A mutant. The contiguous HsUDG Pro271 (equiva- 
lent to BsUDG Pro 190) packs with a base preceding the 
flipped nucleotide, ensuring a correct DNA location. All 



these mutants conserved at least ~50% of the wild- type 
ability to bind DNA, and therefore, it seems unlikely that 
the selected mutations have a significant influence on the 
structural stability of these proteins. In conclusion, all the 
BsUDG residues mutated from the &UDG-p56 interface 
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100 13 6 18 18 32 15 5 20 19 % 



B 120 




wt H28K H28S D35A D35S F36W V38E N42R D45R D45S p56 

100 1 25 27 42 44 6 7 12 75 % 

Figure 6. Role of the mutated residues of protein p56 in UDG inhibition and binding capacity to BsUDG. (A) Ability of the p56 mutants to inhibit 
the UDG activity. p56 mutants and wild-type BsUDG were incubated and then the 34 bp 5 / - 32 P-labelled ssDNA substrate containing a single uracil 
residue at position 16 was added to the reaction. Samples were subjected to electrophoresis in polyacrylamide gels, and bands were detected by 
autoradiography. (B) Influence of the mutated residues of p56 in the interaction with protein GST-lfaUDG as determined by pull-down assay. 
Following incubation the quantity of p56 bound to GST-lfaUDG proteins was examined by Tris-Tricine polyacrylamide gel electrophoresis and 
visualized using Coomassie blue staining. Bands were quantified by densitometry. The percentage of UDG inhibition (A) or UDG-binding capacity 
(B) for the mutants was calculated relative to the wild-type p56. Bar charts represent the average values for three independent experiments with 
standard deviations indicated. 



are important for enzyme activity. In particular, residues 
located at the uracil-binding pocket entrance (Gln64 and 
His 187) and Phel91, a residue that protrudes into the 
DNA minor groove, are essential, in agreement with the 
structural and functional findings of homologue residues 
in other UDGs (19). 

Amino acids involved in the binding of p56 to BsUDG and 
its inhibition 

To determine the ability of BsUDG mutants to form a 
complex with p56, we performed GST pull-down experi- 
ments. The amount of each GST-ifaUDG mutant in 
complex with p56 was measured following incubation 
under standard binding conditions as described in 
'Materials and Methods' section. As shown in Figure 5C 
and Supplementary Figure S4, the stability of the 
ifaUDG-p56 complex varies considerably among the 
mutants. Interestingly, the most affected BsUDG 
mutants (H187A, S189A, P190A, F191A and R194S) con- 
tained point mutations in residues located at loop V. This 
is consistent with the observation that the main interaction 
between BsUDG and p56 is produced through Phel91 
binding, a residue located in the middle of LV. 



Consequently, F191A almost prevents p56 binding. 
Other LV mutants, such as HI 87 A and P190A, also 
show a reduced p56 binding capacity. These residues are 
involved in hydrophobic contacts with ocla of p56 
(residues Val38a and Tyr40a, respectively), indicating the 
importance of these hydrophobic interactions in the 
complex formation. Mutants S189A and R194S still 
showed a significant reduction in p56 binding (84 and 
75%, respectively). Both mutated residues interact with 
Glu37 from p56 subunit A and B, respectively 
(Figure 2). Although the Serl89 interaction is produced 
through its main chain, the intramolecular contact 
produced between Hisl87 and Serl89 side chains can 
explain the large effect of the S189A mutation, as this 
residue seems to participate in a specific UDG surface 
configuration and/or His 187 positioning. 

In a complementary approach, we performed a muta- 
tional analysis of p56 to investigate the role of the selected 
residues in both complex stability and inhibitory capacity 
of the UDG activity. All the p56 mutants tested (H28K, 
H28S, D35A, D35S, F36W, V38E, N42R, D45R and 
D45S) were noticeably less efficient than the wild-type 
p56 at preventing the cleavage of uracil-containing sub- 
strate by BsUDG (Figure 6A and Supplementary Figure 
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S5). Specifically, these substitutions led to ~80% reduc- 
tion in the inhibitory activity with respect to the wild-type 
protein with the exception of mutant F36W that produced 
a 3-fold decrease in inhibition. However, Phe36 is 
expected to be essential in both p56 dimerization and 
Phel91 binding site formation (Figure 2B), suggesting 
that although substantial distortion may be introduced 
by the substitution, the tryptophan side chain probably 
keeps the main features of Phe36. The mutants on the 
acidic residues Asp35 and Asp45 exhibited a 5-fold inhib- 
ition decrease. Both residues form hydrogen bonds with 
BsUDG. Asp35a is able to form links to Asn64, His 167 
and Lysl37 from BsUDG (Figure 2D), whereas Asp45a 
interacts with Lys85 and also makes an intramolecular 
contact with Tyr24a (Figure 2E) contributing to the 
shaping of the interface. D45R and D45S showed 
similar activity, thus suggesting that arginines can be 
easily accommodated on the surface without causing any 
deleterious effects. The mutation of the hydrophobic 
residue Val38 presents a similar effect to the aforemen- 
tioned acidic residues. Val38a stacks to BsUDG Hisl87, 
and, unusually, it points to a polar cavity containing 
several water molecules (Supplementary Figure S6A). 
However, the V38E change probably affects its interaction 
with His 187 or the conformation of neighbour residues, 
decreasing the binding capacity. The most affected 
mutants are H28K, H28S and N42R. His28a makes 
both polar (Alal33) and hydrophobic (His68) contacts 
with BsUDG (Figure 2E and C). In addition, His28 is 
involved in the configuration of the interaction surface 
through intramolecular interactions with Leu39 and 
Glu26, residues that also interact with BsUDG 
(Supplementary Figure S6B). Finally, Asn42a is the only 
p56 residue that points towards the uracil-binding pocket, 
though it does not protrude deeply into the cavity. The 
lack of inhibition in the N42R mutant could be explained 
if the charge introduced is not well accommodated in a 
partially hydrophobic pocket, despite the fact that there 
seems to be enough room to accommodate a large residue. 
In addition, Asn42a is involved in several water-mediated 
interactions through its main and side chain, and its elim- 
ination could also affect binding. 

The significant decreased capacity of p56 mutants to 
inhibit BsUDG activity seems to reflect a reduced ability 
to bind to BsUDG (Figure 6B and Supplementary Figure 
S7), suggesting a role for these residues in the formation of 
the complex. Interestingly, p56 mutants showed relevant 
differences in their binding to BsUDG despite their fairly 
similar inhibitory capacity. Mutants can be classified into 
three groups according to their ability to form stable 
complexes with BsUDG. The first group includes the 
most affected mutants (H28K, V38E, N42R and D45R) 
that only form <10% of the complexes relative to the 
wild-type protein. The second one is constituted by 
mutants H28S, D35A, D35S and F36W that retain -25- 
45% of the wild-type p56 binding to BsUDG, whereas 
D45S, the only member of the third group, maintains 
—75% of the wild- type complex formation capacity. The 
high specificity of the p56 inhibition is based on its ability 
to bind to UDG more tightly than the DNA substrate 
(24). The moderate BsUDG binding capacity displayed 



by some p56 mutants does not seem to be enough to dis- 
sociate DNA from BsUDG, as they were not capable of 
producing any significant inhibition of UDG activity. 
Therefore, these results may suggest that p56 mutants, 
even those less severely affected in BsUDG binding, are 
not able to compete with DNA for binding to UDG as 
efficiently as the wild- type p56. 



DISCUSSION 

Uracil DNA-glycosylases are highly conserved enzymes 
that play a crucial role in maintaining genomic integrity 
by efficiently removing uracil from DNA (43). 
Nevertheless, two bacteriophage proteins, named Ugi 
and p56, have been characterized as natural inhibitors of 
UDGs from a variety of different organisms, including 
their host B. subtilis. Intriguingly, both inhibitors act as 
DNA mimic proteins that might have evolved to inacti- 
vate BsUDG activity, representing a viral defence mech- 
anism to overcome the host uracil excision repair 
pathway. Specifically, the phage PBSl/PBS2-encoded 
Ugi is essential for virus viability by impeding the 
excision of uracil residues from the unusual uracil-con- 
taining DNA of the phage (18). In addition, we have 
recently reported that phage 4>29 protein p56 prevents 
viral DNA replication impairment caused by the 
removal of any uracil erroneously incorporated into the 
replicative intermediates (24,44). 

In the present study, we have solved the crystal struc- 
tures of both uncomplexed and complexed BsUDG with 
phage 4>29 protein p56 to a resolution of 2.9 and 1.45 A, 
respectively, to analyse the structural basis for enzyme 
inhibition. The most significant interactions in the forma- 
tion of 2faUDG-p56 complex were identified and 
compared with those in the structural model of BsUDG- 
DNA. Moreover, we have taken advantage of the fact that 
UDG is the only enzyme known to be inhibited by two 
different DNA mimic proteins to compare both p56 and 
Ugi interactions with UDG (Figure 7A). This approach 
allowed us to identify the structural and biochemical 
properties shared by both inhibitors that enable them to 
block the DNA-binding domain of BsUDG. 

Previous studies have determined that the mechanism 
for uracil repairing by UDGs in diverse organisms 
included the displacement of the uracil lesion to an 
extrahelical position and the insertion into the DNA base 
stack of the side chain of a protruding leucine (Leu272 
in HsUDG, Leul91 in EcUDG) located at UDG loop V 
(14,38,45). Interestingly, our analysis revealed that the 
main feature of the ifaUDG-p56 complex formation is 
the recognition of the Phel91 residue from BsUDG by a 
specific hydrophobic pocket at the p56 dimer interface 
(Figure 7B). Residues located at the DNA-binding loops 
of BsUDG, particularly those in loop V, are responsible for 
sealing this interaction through polar and hydrophobic 
interactions (Figure 2A). Other studies have described 
that Ugi also envelops a protruding leucine residue from 
human and E. coli UDG (19,46,47). Functional character- 
ization of specific BsUDG mutants showed that the most 
important residues for p56 binding (His 187, Serl89, 
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Figure 7. Comparison of p56, Ugi and DNA-binding mode in UDG proteins. (A) Structure of 2faUDG-p56 (left), iicUDG-Ugi (middle) (pdb code: 
lugh) and //sUDG-DNA (right) (pdb code: 1 emh) complexes. //sUDG-Ugi complex keeps the general features of ifcUDG-Ugi. In all cases, the 
UDG protein is shown as electrostatic surface, and the inhibitory proteins or DNA are shown as cartoons in magenta/blue, green and olive colours, 
respectively. (B) Zoom showing the Phel91 (left) /Leul91 (middle) /Leu272 (right) binding pockets. UDGs are shown as cream transparent surface, 
and p56/Ugi/DNA elements as cartoons with the essential residues highlighted as sticks. (C) Detail showing the five UDG loops (LI-LV) involved in 
binding in different brown shades. Proteins p56, Ugi or DNA important elements for UDG binding are shown as transparent cartoons. Essential 
residues for complex formation are shown as sticks and the polar interactions marked as dashed lines. 



Pro 190 and Phel91) were also essential for the enzymatic 
activity (Figures 5 A and 6C). The equivalent residues in 
HsUDG are also implicated in the uracil excision 
process. In particular, it is thought that the insertion of 
the Leu272 side chain (Phel91 in BsUDG) into the DNA 
minor groove to replace the flipped-out uracil allows 
His268 (His 187 in BsUDG) to move to a catalytic 
location (11). Ser270 (Serl89 in BsUDG) and Pro271 
(Pro 190 in BsUDG) mediate in this process by ensuring a 
correct uracil positioning through interactions with the 
preceding nucleotide base and subsequent nucleotide phos- 
phate, respectively. 

Unexpectedly, the substitutions of the selected residues 
clustered in loop V notably reduce the capacity of binding 



p56 without significantly altering the DNA-binding ability 
of the enzyme. The biochemical properties of the inter- 
action between these critical BsUDG residues and their 
counterparts in p56 provide a plausible explanation for 
the fact that UDG fits its inhibitor more tightly than the 
DNA substrate, as protein p56 is able to dissociate pre- 
formed UDG-DNA complexes (24). Positively charged 
BsUDG amino acids, such as Lys85 from loop II and 
Argl66 from loop IV, seemed to be more relevant 
than loop V residues for binding to the DNA substrate 
(Figure 5B). 

Our studies suggest that the two helical regions of the p56 
dimer (a la and alb) fit into the cavities where both DNA 
strands are proposed to bind to BsUDG (Figure 7C). 
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Ugi reaches the equivalent region in HsVDG and EcVDG 
through its (31 strand, a structure with an uncommon shape 
that resembles the bent twist of the DNA in complex with 
UDG (19,46). However, neither p56 nor Ugi interact 
directly with residues of the UDG uracil-binding pocket 
(19). Based on these results, we propose that protein p56 
might target the conserved mechanisms for uracil flipping 
and amino acid intercalation into the base stack. Thus, p56 
probably mimics just a short DNA fragment, as UDGs 
only seem to interact with the extrahelical uracil and the 
phosphate backbone flanking the flipped base during the 
catalytic activity. Previously, it has been suggested that 
protein Ugi could target the same step in the enzymatic 
reaction for UDG inhibition (46). Therefore, both inhibi- 
tors possibly prevent non-specific cross-reactive events with 
other DNA-binding enzymes by targeting the specific mode 
of action used mainly by UDGs. 

The binding to uracil-containing DNA induces a con- 
formational change in UDGs from an 'open' to a 'closed' 
state that generates the enzyme active site (42). This 
movement in HsUDG is described as a 10° rotation (38). 
On the contrary, only discrete and local structural changes 
occur in BsUDG and p56/Ugi proteins on complex forma- 
tion, suggesting that the shape and electrostatic comple- 
mentarity is pre-existing [Figure 3; (46)]. This is 
consistent with the tighter binding of p56/Ugi to UDG 
than DNA. As expected for DNA mimic proteins, p56 
and Ugi regions implicated in UDG recognition exhibit a 
negative electrostatic potential, compatible with the posi- 
tively charged surface of the UDG interface [Figure ID; 
(1 1)]. In agreement with these structural data, mutagenesis 
analysis of p56 acidic residues located in the complex inter- 
face have confirmed the key role of Asp35 and Asp45 (this 
work), and Glu26, Glu37 and Aspl8/Aspl9/Asp20 (39) in 
complex formation. Previous studies have also shown that 
the mutation of two DNA phosphate-mimicking Ugi carb- 
oxylate groups, Glu20 and Glu28, impeded the formation 
of a stable complex with UDG, and thus, the enzyme 
activity was not completely inhibited (48). 

Our results provide insights into the different behaviour 
of the two inhibitors, as £cUDG-Ugi and HsVDG- 
Ugi complexes generate a larger and more specific inter- 
face in addition to a more favourable binding energy 
on their formation than the 2faUDG-p56 complex 
(Supplementary Table S5). Moreover, the recognition 
carried out by Ugi of the protruding UDG residue 
seemed to occur over a larger binding surface compared 
with that in p56. Remarkably, a comparison between the 
experimental enzyme-inhibitor structures with several the- 
oretical models of p56 and Ugi binding to the same UDG 
isoform, suggested that higher polar and electrostatic 
complementarity was found in UDG-Ugi complexes 
(Supplementary Methods and Supplementary Table S5). 
Both inhibitors show an expected enhanced fit to UDG 
from their natural host, B. subtilis, probably as a result of 
an increase in the number of polar interactions and espe- 
cially, ion-pairs contacts (Supplementary Table S5). 

Altogether, our results provide new insights in under- 
standing the basis for the binding specificity of DNA 
mimic proteins, and they could also contribute to the 



design of selective UDG inhibitors with different 
properties. 
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